M201 0 THE OPEN UNIVERSITY g 
Mathematics: A Second Level Course 


Linear Mathematics Unit O 


Linear Algebra 


J 


The Open University 


Mathematics: A Second Level Course 


Linear Mathematics Unit 0 


LINEAR ALGEBRA 


Prepared for the Course Team 


The Open University Press 


The Open University Press Walton Hall Milton Keynes 


First published 1978. Reprinted (with corrections) 1979. 
Copyright © 1978 The Open University 


All rights reserved. No part of this work may 
be reproduced in any form, by mimeograph 
or any other means, without permission in 
writing from the publishers, 


Produced in Great Britain by 
Technical Filmsetters Europe Ltd. 
76 Great Bridgewater Street, Manchester M1 5JY 


ISBN 0 335 01125 X 


This text forms part of the correspondence element of an Open University 
Second Level Course. The complete list of units in the course given at the 
end of this text. 


For general availability of supporting material referred to in this text, please 
write to Open University Educational Enterprises Limited, 12 Cofferidge 
Close, Stony Stratford, Milton Keynes MK11 1BY, Great Britain. 


Further information on Open University courses may be obtained from 


the Admissions Office, The Open University, P.O. Box 48, Milton Keynes 
MK7 6AB. 


12 


Contents 


0.1 


0.1.1 
0.1.2 
0.1.3 


0.2 


0.2.0 
0.2.1 
0.2.2 
0.2.3 
0.2.4 
0.2.5 
0.2.6 


0.3 


0.3.1 
0.3.2 
0.3.3 
0.3.4 


0.4 


0.4.0 
0.4.1 
0.4.2 
0.4.3 
0.4.4 
0.4.5 


0.5 


Set Books 
Conventions 
Introduction 


Linearity in the Plane 


The set R? 
Linear Transformations on the Plane 
Summary of Section 0-1 


Geometric Vectors 


Introduction 

Translations 

Addition of Geometric Vectors 

Scalar Multiples of Geometric Vectors 
Linear Dependence and Independence 
Basis Vectors 

Summary of Section 0.2 


Vector Spaces 


The Algebra of Lists 
Vector Spaces 

Bases and Dimension 
Summary of Section 0.3 


Mappings of Vector Spaces 


Introduction 

Mapping one Vector Space to Another 
Linear Transformations 

The Kernel 

Properties of the Kernel 

Summary of Section 0.4 


Summary of the Unit 


LM 0 


Set Books 


D. L. Kreider, R. G. Kuller, D. R. Ostberg and F. W. Perkins, An 
Introduction to Linear Analysis (Addison-Wesley, 1966). 
E. D. Nering, Linear Algebra and Matrix Theory (John Wiley, 1970). 


It is essential to have these books; the course is based on them and will not 
make sense without them. 


Conventions 


Before working through this correspondence text make sure you have read 
A Guide to the Linear Mathematics Course. Of the typographical 
conventions given in the Guide the following are the most important. 


The set books are referred to as: 
K for An Introduction to Linear Analysis 
N for Linear Algebra and Matrix Theory 
Note 


This unit is not based on the set books. It has been written especially for the 
benefit of students who have taken the Mathematics Foundation Course 
M101 (The Open University Press, 1978). : 


LM 0 


LM 0.0 


0.0 INTRODUCTION 


This new first unit of the course gives a preview of the fundamental ideas of 
vector spaces and linear transformations which you will meet again in Units 
1 and 2. It is of crucial importance for all of your subsequent work on this 
course that you should acquire a firm understanding of these central 
concepts: such understanding is best assured by prolonged exposure to the 
new ideas. 


Consider the following problems. 
(a) Solve 
2x +3y =8 
x4 4y —9. 


(b) Findthe point in the plane whose image is (0, 1) when rotated about the 
origin through the angle 7/3. 
(c) Find the general solution of the differential equation 


q(t) = I — Aq(t). 


These three seemingly unrelated problems can be solved by methods 
discussed in the Foundation Course, M101. What they have in common is 
an element of linearity which will become apparent as this course unfolds. 


The notion of linearity arises essentially from geometric considerations. In 
the three-dimensional world, the most common mode of simplifying a 
problem is to restrict consideration to a plane or a line. These subsets of 
Space possess a property which we recognize by its lack of curvature; it is 
this property that we shall seek to characterize. Once we have done that, we 
shall find that there are many sets of ‘objects’ in mathematics which can, by 
a judicious choice of viewpoint, be deemed to possess the property of 
linearity. 


Our aim is to find, for suitable sets, methods by which the ‘objects’ can be 
added to each other and scaled by real numbers. It turns out that these two 
fundamental notions encapsulate what we mean by linearity. 


LM 0.1.1 


0.1 LINEARITY IN THE PLANE 
0.1.1 The Set R? 


The algebraic view of geometry, which has prevailed in much of the 
Foundation Course, requires that we abandon geometric intuition in 
favour of the interpretation of the symbols of algebra. Thus a point in the 
plane is represented by a number pair, say (x, y), which implicitly refers toa 
predetermined choice of coordinate axes and a unit of measurement. A 
curve is considered as a subset of the plane consisting of all the points whose 
coordinates satisfy some equation such as 

{(x,y)ix? + y? = 1} 
or 

{(x, y):4x + 3y = 2}. i 
The first of these is a circle and the second a straight line, but how can we tell 
without actually sketching the curves? 


Let us simplify matters by considering only lines through the origin. 


(1,2) 


The line through the origin which also contains (1, 2) consists of all points 
whose coordinates are of the form (4,24) for some Ae R. 


The equation of this line is easily seen to be 
{(x,y):y — 2x = 0}. 


From the algebraic viewpoint, the distinguishing characteristic of straight 
line is that its equation can involve the following two operations: 


multiplication of a coordinate by a real number (or scaling), 
addition. 


This infallible rule tells us immediately that 
x? 3 y? 
ate! 

cannot be a straight line, on account of the x? and y?, even if we have never 
heard of an ellipse. 


Our aim now is to extend the arithmetic operations of scaling and addition 
to other sets. We might as well start with the whole plane, or rather the set 
R? which is defined as the set of all real-number pairs. 


R? = {(x,y)ixeR and yeR}. 


How shall we define addition and scaling on R?? Well, we have already had 
cause to consider the line passing through the origin and (x,y), which 
consists of points whose coordinates are (Ax, 4y) and we shall define scalar 
muluplication of a pair by 4 according to the rule 


A(x, y) = (Ax, Ay) 


In this course we do not use the open 
typeface for the standard sets. Thus the 
set of reals is denoted by R rather than 
R. 


LM 0.1.1/0.1.2 


Thus the real number 4 scales the pair (x, y) in R? by multiplying each 
coordinate separately. Likewise, if we have two pairs (x;, y1) and (x2, y2) we 
define addition of pairs according to the rule 


(Xis Y1) + Gaya) = (X1 Ns yi y3 


Note that although we use the same plus symbol on both sides of this 
defining equation, its meaning on the right-hand side is unambiguously the 
familiar addition of reals which we know so well, whereas on the left-hand 
side the + serves as a newly defined operator on members of the set R?. 


Exercises 
1. Evaluate the following: 


@ (4,0) +, -3) 

G) (5, —1) + (—1,13) 
Gii) (a,b) + (a,0) 

(v) (ed) + (—e, —4) 


2. Evaluate the following: 


@ 31,7) 
Gi) 45(0, 0) 
Gii) 1(a,b) 


3. Evaluate the following: 
(i) | —1(34,2) + (—3,2) 
(i)  7(1,—1) + 0(2, 2) 
(iii) 4-1,4) + 86, —2) 
(iv) A(1, 0) + (0,1) 


Solutions 


L 6 (1,0) + 3, -3 = (1 + 3,0 —3) = (4, -3). 
G) (4,3) Gii) (2a,b) (iv) (0,0). 
2. () (321 (i) (00) (B) (a,b). 
3. à) (-650 Gi) (7,-7) (Gi) (600 (v) Q5) 


0.1.2 Linear Transformations on the Plane 


The operations of addition and scalar multiplication endow R? with some 
structure. We could now proceed with the investigation of this structure (as 
indeed we shall do later on), but first it will not come amiss if we satisfy 
ourselves that we are going to achieve something by imposing a 
mathematical structure on R?, over and above the ability to play little 
arithmetical games with number pairs. 


Suppose T denotes the transformation R? —» R? which rotates each point 
of R? through an angle 0 anticlockwise round the origin. It may be shown 
that 


T:(u, v) ++ (u cos 0 — vsin 0, usin @ + vcos 6). 


Now, geometrically, it is clear that a rotation of R? maps straight lines to 
straight lines, in other words, it preserves the linear structure of the plane. 
Algebraically, the idea of a function preserving structure is expressed by the 
image of a combination being the corresponding combination of the 
images. 


Exercises 


1. What are the images of (1,0) and (0, 1) under T? 
2. What is the image of (3, — 2) under T? 


Compare this with addition of com- 
plex numbers, defined in M101 Block 
VI Unit 1, Section 1.2. 


In this course the solutions will be 
printed immediately following the 
exercises. It would be advisable to 
keep a size A5 card handy, to cover the 
solution while you are working an 
exercise. 


M101 Block IV Unit 3, Section 3.4. 


M101 Block VI Unit 4. 


LM 0.1.2 


3. Express (3, — 2) as a combination of (1,0) and (0, 1) using the operations 
of addition and scalar multiplication. 


Solutions 


1. T(1,0) = (cos 6, sin 8); T(0, 1) = (—sin 0, cos 0). 
2. T(3,—2) = (3cos@ + 2sin 0, 3 sin 8 — 2cos 0). 
3. (3, —2) = 3(1,0) — 2(0, 1). 


We have just seen that (3, — 2) can be expressed as the combination 
3(1, 0) — 2(0, 1). 


Because the operations of addition and scalar multiplication used to obtain 
this combination are linear operations, we say that (3, —2) is hereby 
expressed as a linear combination of (1,0) and (0, 1). The image of this linear 
combination is given by 


T(3, —2) = (3cos@ + 2sin 0,3 sin 0 — 2cos0) 
= 3(cos 0, sin 0) — 2(— sin 0, cos 0). 
= 3T(1,0) — 2T(0, 1) 


In other words the image of (3, — 2) is the same linear combination of the 
images (cos 6, sin 0) and ( — sin 0, cos 0) as (3, — 2) is of (1, 0) and (0, 1). In fact, 
the specific numbers 3, — 2 were not intrinsic to our argument and, in a like 
way, we could have shown that for any number pair (4, uj) e R?, 


(A, u) = A(50) + 4(0, 1) 


and 
T(2, u) = AT(1,0) + nT(0, 1). This is essentially the map-reference 
: : : property of M101 Block IV Unit 4, 
So the rotation T preserves the linear structure of R?. This result says, Section 4.3. 


effectively, that provided we know which elements of R? are the images of 
(1, 0) and (0, 1) we can determine the image of an arbitrary element (A, u) as 
the appropriate linear combination of those images. 


We can develop this further more easily if we introduce the notation 
i-(,0 j= (0,1) The equals sign in these formulas 
: " A "S , . mean, e.g., that / is another name for 
to avoid having to write these number pairs in full each time. A typical (1,0). 
element of R? is 
(A, u) = Ài 4: pj. 
The images of i and j under T are 
T(i) = (cos 0, sin 0) = cos 0i + sin 6j 
and 
T(j) = (—sin 8, cos @) = —sin 0i + cos 0j. 
The linear nature of rotation manifests itself in the formula 
Ti + pf) = ATQ) + wT). 


In the Foundation Course we observed this very result by a somewhat 
different means, using matrices. In fact, the matrix approach is intimately 
connected with this formula. We write a column matrix instead of using i and 


j, as follows: 
a 
represents (4, 4) 
n 
cos @ , 
ls 4 represents T(i) 
—sin 0 TT 
cos represents T (j). 


LM 0.1.2/0.1.3 


The rule for matrix multiplication gives us 

cos? —sin@ |[4 Acos@ —psin@ 

; = 2 + 

sin@ cos @ || i Asin uncos 
where the given matrix represents our rotation T. It is no accident that the 
columns of the 2 x 2 matrix representing T are identical with the 2 x 1 
column matrices representing T(i) and T(j), as you can see in the next 
exercise. 
Exercise 


Multiply out 

(i) [cos0 —sin@][ 1 (ii) [cos@ -—sin8|[O0 
sind — cos0||0 sind — cosO || 1 

Solution 


(i [cos (i) [| —sin@ 
sin @ cos 


0.1.3 Summary of Section 0.1 


In this section we have defined the terms 
addition (of pairs) (page 7) 
scalar multiplication (of pairs). (page 6) 
We introduced the notation 
R?, (page 6) 


You are not expected to remember every detail of this section; we have 
attempted here only to take an example from within your experience to set 
the stage for the study ahead of us. This study will involve us in constructing 
the linear operations of addition and scalar multiplication on certain 
suitable sets and considering the properties of transformations between 
such sets which respect this linearity. 


LM 0.20 
0.2 GEOMETRIC VECTORS 
0.2.0 Introduction 


The speed and direction of an aeroplane over the ground depend not only 
upon the thrust ofits engines but also upon the strength and direction of the 
wind. A cross-wind will blow the aircraft off course unless the pilot heads 
slightly into the wind to compensate. This is a basic problem of navigation. 


To construct a mathematical model of such a situation, we represent the 
wind’s effect by a directed line segment or arrow pointing in the direction of 
the wind and of length proportional to the wind speed. The velocity of the 
aircraft through the air is similarly represented by another arrow. 


The navigational problem of determining the aircraft’s course over the 
ground then becomes a mathematical problem: how to combine the arrows. 


Similarly the effect of a force may be modelled by an arrow, whose length 

and direction correspond to that of the force. The resultant of two forces 

may be determined from the parallelogram of forces, which is a geometric M101 Block V Unit 4, Section 4.3. 
method of combining the two corresponding arrows. 


LM 0.2.1 


0.2.1 Translations 


We shall, by looking at translations of the plane, obtain a convenient means 
of discussing the mathematical properties of physical quantities such as 
velocity and force which can be modelled geometrically by arrows. 


A translation of the plane moves each point through a fixed distance in a 
fixed direction. To picture a translation, we might indicate its effect on 
several points of the plane. 


Each arrow here has the same length and the same direction; and each 
indicates the effect of the same translation on the point at which its tail lies, 
moving it to the position occupied by its head. (If we regard each arrow as 
indicating the motion of a particle of air over a fixed time-interval, then 
what we can have here is a constant wind pattern.) 


A translation, then, is represented by a set of arrows all sharing the same 
length and the same direction. 


Another way of looking at this is to define a relation between arrows, such 
that two arrows are related if and only if they have the same length and the 
same direction. 


This is easily seen to be an equivalence relation on the set of all arrows in the 
plane, but we shall not stop to prove it here. 


Under this relation, each equivalence class is called a geometric vector, 
Such a geometric vector is a set of arrows: the set of all arrows sharing one 
particular length and one particular direction. It therefore corresponds 
exactly to a particular translation of the plane. Strictly speaking, a 
translation is represented by a geometric vector—but the two concepts 
share the same mathematical properties and we shall regard them as 
interchangeable. 


Clearly a geometric vector is uniquely determined by specifying any one of 
the arrows belonging to it. 


We shall write AB to denote the geometric vector containing the arrow 
from A to B—but note that AB is not fixed in the position AB. This is just 
another way of saying that ABre represents the translation of the plane which 
transforms A to B. 


A 


The two arrows shown in the diagram above belong to the same geometric 
vector, so we may write 


AB = CD. 


Translations were discussed in M101, 
Block I Unit 3 and Block IV Unit 3. 


A discussion of equivalence relations 
can be found in M101 Block IV Unit 2, 
Section 2.3. 


LM 0.2.1 


Exercise 


F E 


The above figure is regular hexagon. We have omitted the arrow heads as 
these are implied in the following statements. In each case indicate if the 
statement is true or false: 


ü AB= ED TRUE/FALSE 
(i) FQ- ED TRUE/FALSE 
(ii) AQ — EF TRUE/FALSE 
(iv) BC= AD TRUE/FALSE 


Solution 


(i) | TRUE. The same translation which takes A to B takes E to 
D, so both arrows represent (belong to) the same geometric 
vector. 


(ii) TRUE. Both the arrows FO and ED belong to the same 
geometric vector. 


Gii) FALSE. The arrows AO and EF (representatives of AO and 
EF respectively) are in opposite directions. 


(iv) FALSE. The arrows BC and AD have different lengths. 


It is convenient to work with examples in the plane, but we could equally 
well have geometric vectors in three-dimensional space. If t denotes a 
translation of three-dimensional space, then to each point P there is a 
unique point Q such that t is represented by PQ. 


" Q-t(P) 


P 


12 


LM 022 
0.2.2 Addition of Geometric Vectors 


We compose two translations by first performing one translation and then 
performing the other. The composition of translations induces a binary 
operation on the set of geometric vectors, and we call this operation 
addition. When two translations, s and t say, are composed, the result is 
another translation. 


For convenience we shall sometimes 
put the arrowhead in the middle of an 
arrow. 


The best way to see this is to consider the effect of the composition t es of the 
two translations sand t on a general point P in the plane. Wecan represent s 
by an arrow with its tail at P, and t by an arrow with its tail at f (P). The 
effect on P of s, then t—that is, t os—is then represented by the arrow from 
P to t(s(P)). Likewise, if we consider the effect of t es on any other point Q 
then we obtain a triangle of arrows whose sides are parallel and of equal 
length to the corresponding sides in the triangle obtained at P. In 
particular, any point Q is carried by t es through the same distance and in 
the same direction as P, that is, tos is a translation. 


t(s(P)) s(Q) t t(s(Q)) 


t 
\A V 
P a 


If we now call the associated geometric vectors s and t, then the triangle 
of arrows defines a resultant geometric vector u, say. We write 


s(P) 


gtisu 


and call u the resultant of s and t . The rule for addition of geometric vectors 
is uniquely defined by the composition of translations. 


t 


LM 0.2.2 


The choice of the symbol + to denote the binary operation which yields the 
resultant of two geometric vectors is not arbitrary: the properties of 
addition of real numbers are also properties of the addition of geometric 
vectors. Two such properties are considered in the exercises which follow. 


Exercises 


1. Draw a diagram to illustrate the geometric vectors a + b and b + a. 
Are the two geometric vectors equal? Is addition of geometric vectors 
commutative? 


2. Use the following diagram to illustrate the associative property of 
addition of geometric vectors: 


(a+ b)tc- a (be c) ; 


Solutions 
l. 
It is true that a -- b — b +a, and therefore addition is 
commutative. This diagram shows an alternative way of 
describing the addition of two geometric vectors: if a and b are 
represented by two arrows with their tails at the same point 
+ then, on completion of the parallelogram, the sum of the two 
geometric vectors is represented by the diagonal arrow. Compare with the parallelogram of 
7 forces in M101 Block V Unit 4. 


To continue the analogy we would like to define a geometric vector having 
the property of the number zero for addition in R. A translation through 
zero distance in any direction carries each point of the plane to itself, that is, 
it is the identity transformation. Composition of any other translation t 
with the identity transformation has the same effect as t itself; the 


LM 0.2.2 


corresponding, result for geometric vectors is 
£t0-i-Otti 


where Q is the geometric vector whose arrows have zero length (and 
arbitrary direction.) We may call Q the zero or null geometric vector. 


Associated with any translation t we can perform the inverse translation 
which carries each point t(P) back to P. The inverse of t is a translation in 
the direction opposite to that of t and through the same distance as t. If t is 
the geometric vector corresponding to the translation t, we write — t for the 
geometric vector corresponding to the inverse translation. 


It should be clear that 
£*-(-D0-90 


In line with the spirit of M101 Block VI we now present a summary of the 
additive properties of geometric vectors. 


Al a+ bisa geometric vector. (+ is CLOSED) 
42 a+(b+c)=(a+b) +e (+ is ASSOCIATIVE) 


A3 There is a geometric vector, denoted Q, such that for all a, 


a+0=a. (IDENTITY ELEMENT for +) 
A4 For each geometric vector a there exists a (unique) geometric vector 
— a, such that 
at(-a)-Q (existence of INVERSES) 
AS a+b=b+a (+ is COMMUTATIVE) 


We now recognize that addition of geometric vectors has the structure ofa 
commutative group. 


It is sometimes convenient to have a notation for subtraction of vectors: we 
define 


The example which follows illustrates a method of manipulating equations 
involving geometric vectors. 

Example 

From the definitions, prove that 


(a-b-c) = (a=c+ b) 


Solution 
(a—b=c)=a+(—b)=c (definition of subtraction) 
=(a+(-b) +b=c+) 
=da+((-b)+b)=c+6 (associativity of +) 
=a+0=c+b (definition of — b) 


sa=ctb. (property of 0) 


M101 Block VI Unit 2. 


Here the equals sign means that q — b 
is another way of writing a + ( —5) 


The symbol = is read "implies" and 
means that if the left-hand statement is 
true then the right-hand statement 
must also be true. 


LM 0.2.2/0.2.3 


In particular, if two geometric vectors a and b satisfy 


a-b-9 


then a and b are equal (that is, they represent the same translation). 


0.2.3 Scalar Multiples of Geometric Vectors 


Any geometric vector has an associated length and direction. Let us 
consider for a moment the geometric vectors sharing one particular 
direction. If we compose a translation with itself we obtain another 
translation in the same direction. We may ask: 


what is the length of a -- a in terms of a? 


im 


1o 


a 
ue "m 


The translation corresponding to a -- a carries points through twice the 
length of aand in the same direction. This suggests that we write 2a to mean 
a +a, the geometric vector having the direction as a but twice the length. 
We can generalize this idea to define 2a, for any real 2 > 0, to mean the 
geometric vector corresponding to a translation through a distance J times 
that of a and in the same direction. When A = 0 we obtain the geometric 
vector corresponding to zero distance, that is, 


Og=0. 


When 4 < 0 we may define 2a to be the translation through the (positive) 
distance —2 in the direction opposite to that of a. This ties in well with the 
definition of —a; indeed 


(-1a-2-—a. 


This completes the definition of our second operation on geometric vectors, 
which we call scalar multiplication. We say that a has been scaled by A or 
multiplied by the scalar A. Unlike addition of geometric vectors, scalar 
multiplication is defined not between two geometric vectors but between a 
scalar (or real number) 4 and a geometric vector a. Throughout this course 
we shall be considering sets of "vectors" which can be "multiplied" or 
"scaled" by real numbers; in such a context we shall refer to the real 
numbers as scalars, 


Note that we have adopted the usual algebraic convention of representing 
multiplication by juxtaposing the scalar and the vector, avoiding the need 
for a multiplication symbol. Scalar multiplication of geometric vectors has 
the following properties. 


Bl dais a geometric vector for each Ae R. 
B2 Alua) = (iu)a 

B2 Ala +b)= 1a + 2b, 

B4 (2+ p)a= Aa + pa, 

B5 la=a 


LM 0.23/0.24 
These properties follow from the definition of scalar multiplication; we 
shall demonstrate the proof for B3. 


If 2 = 0, there is not much to demonstrate. Suppose 2 > 0; then we have the 
following diagram, where c = 2a + 2b. We want to show that € — A(a +b). 


Ab Z 
c 
x 
Since? 12. 1 and le ABC = le X YZ, th triangl 
AB BC.” nd angle = angle , the two triangles are 
similar. Therefore 
XZ 
a=) 
AC 


Further, XZ is parallel to AC, and therefore XZ = 1AC, ie. 
£- Aa D) 


If A < 0, we have a similar argument. 


0.2.4 Linear Dependence and Independence 


Consider an ordinary (rectangular Cartesian) coordinate system in the 
plane. 


The translation taking the origin (0,0) into the point (5,3), say, can be 
accomplished by composing two translations, one taking the origin to the 
point (5,0), the other taking the origin to the point (0,3). These are 
translations parallel to the x-axis and y-axis respectively. 


Let the corresponding geometric vectors be a and b respectively, and let i 
and j be “unit” geometric vectors parallel to the axes as shown in the 
diagram. 


Since a and i share the same direction, they differ only in length. Since the 
distance from the origin to (5,0) is five times the distance to (1,0) the 


7 


LM 0.2.4 


corresponding translations are related in the way which we used to define 
scalar multiplication, and so a = 5i. Similarly we can see that b = 3j. 
Therefore 


a+ b=5i *3j. 


and this corresponds to the translation which takes the origin to the point 
(5,3). 


For the present, it is sufficient that you should have an intuitive grasp of all 
this. You can probably see now that the translation which takes the origin 
to the point (x, y) corresponds to the geometric vector xi + yj. There is thus 
a unique correspondence between geometric vectors and points in a 
Cartesian coordinate system. 


i 
Moreover, any geometric vector in the plane can be expressed in the form 
Ài + uj, where (4, 4) is the image of the origin under the corresponding 
translation. An expression such as 7 i + jj is called a linear combination of 
i and j. 


In general, given geometric vectors a;,...,a,,, we can consider linear 
combinations of the form 

Adi o Án, 
for scalars 2,,..., 44. 


If we are given a set (d,,...,a,) with the property that every geometric 
vector in the plane can be expressed as such a linear combination, then the 
set (a,...,a,) spans the set of all geometric vectors in the plane. 


Thus, (i, j} spans the set of (planar) geometric vectors, since each geometric 
vector can be expressed as a linear combination 


Aic nj. 


In the examples and exercises which follow, we shall consider the geometric 
vectors given by: 


a-it2j 
b-itj 
£-2i 2 
d=2i+2j 
Example 1 


Does the set, {a,b,c} span the set of planar geometric vectors? 


Solution 


If v is a geometric vector in the plane we know that we can find scalars 
À, HER such that 


yp=hitnj. 
We note that 


atb-c-3j 


and 
gmk 
so that 
“inte 
and 


LM 0.2.4 


Hence 
2-Àictuj 
Å B 
-38*3(atb- 9 
E DELL M 
hee je 


that is, any y can be expressed as a linear combination of a, b and c. In other 
words, a, b and c spans the set of geometric vectors. 
Exercises 
1. Express 3i + 2j as a linear combination of a, b and c. 
2. Show that(b, i) spans the set of geometric vectors. 
3. Show that (b, d) does not span the set of all geometric vectors. 
Which subset of geometric vectors is spanned by (5, d)? 
Solutions 
l. Using the formula obtained in the example, we obtain 
3i 2j - da ibi de. 
2. Since b = i + j, we have 


Aituj-ubt(-ui. 


This corresponds to an oblique coordinate system, as shown in 
the figure. 


3. It is sufficient to find ONE geometric vector which cannot be 
expressed as a linear combination ab + fid. Now 


ab + Bd = (« + 20). j), 
So it is clear that no choice of o, Be R will give 
ab + Bd= i, 
for example. 
In fact, 
ab+Bd=ab+2Bb 
= (æ + 2f)b 
(since d = 2b). 


19 


LM 0.24 
So{b, d}spans only the subset of geometric vectors parallel to 
. b (ie. the scalar multiples of b). 
The equation d — 2b can be rearranged to 
2b- d= 
which expresses Q as a (nontrivial) linear combination of b and d. 


Whenever there exist scalars 4,,...,2,, not all zero, such that 


Arai + cot Am am O 
we say that the set of geometric vectors{a,...., a,,} is linearly dependent. 
Otherwise, if 2,a, + -+ + 2,4, = Q implies that 4, = --- = Am = 0, the set 


{@,,...,@m} is said to be linearly independent. The simplest example of a 
linearly independent set is (a) where a is any geometric vector other than 0. 
Another simple example is (i, j}, for the only linear combination of i and j 
which can yield 0 corresponding to the zero translation) is 0i + Oj. 
Example 2 


(5, d}is a linearly dependent set, since d — 2b = Q. 


Example 3 
{b, i }is a linearly independent set, for if «b + Bi = 0 then 
(+ Bitaj =0 
This clearly implies that « + f = 0 anda = 0. Soa = f = 0. 
Thus we have shown that ab + fi = 0 can hold only if « = f = 0. 


Exercise 
4. Show that the geometric vectors a,b,¢ of Example | form a linearly 
dependent set. 
Solution 
4. We must look for scalars a, f, y which satisfy the equation 
«a+ ßpb+ye= 0. 
In terms of the geometric vectors į and j we have 

ali +2j) + Bi tj) +7(2i)=0 


=> (a+ B+ 2y)i + (20+ B); = 0 

Hence a + f + 2y = 0 and 2a + f = 0, and these equations 
have many non-trivial solutions, such asa = 2,8 = —4,y = 1. 
Thus ` 


2g—4b+ c= 0, 
and this set of three geometric vectors is linearly dependent. 


The following table summarizes the properties of the sets of geometric 
vectors studied in the foregoing examples and exercises. 


Set spans linearly independent 
{abe} J x 
{k.i} v v 
(ud) x x 
{i-i} v v 


20 


LM 0.2.5 


0.2.5 Basis Vectors 


We have expended some considerable effort in determining whether given 
subsets of geometric vectors in the plane (a) span all the geometric vectors, 
and (b) are linearly independent. The spanning property is clearly useful— 
it tells us whether an arbitrarily chosen geometric vector can be expressed 
a linear combination of the members of our chosen set (which we hope is 
small)—but how does linear independence help us? The answer is as 
follows. If a set spans, but is linearly dependent, then each geometric vector 
may be expressed in many different linear combinations of the members of 
the chosen set. Linear independence guarantees uniqueness—there is only 
one linear combination of a linearly independent set that equals a given 
geometric vector. 


When we have a subset of the geometric vectors which is both linearly 
independent and spans the whole set, we call the subset a basis (pl. bases) for 


the set of geometric vectors. The set { i, j ) clearly is a basis, and in fact : 


constitutes the motivation for considering bases. A basis for the set of 
Beometric vectors has the property that it can be used to establish a grid for 
(oblique) coordinates of the plane. You should check that the set ( i, b } 
discussed previously satisfies the definition of a basis. 


Example 


However, a and b are linearly independent, for if «a + fb = Q, then 


We have seen that the geometric vectors 4, b, c are linearly dependent. 


ali + 2j) + BG. j) = 0. 


This requires that a + f = 0 and 2a + f = 0. The only solution of this pair 
of equations is a = fj = 0. 


Further, a and b span the set of planar geometric vectors for we have, for 
any À, HER, 


Ait uj = (u — ż)a + Q2 — pb. 
Thus (a, b) is a basis for the set of planar geometric vectors. 


Exercises 
l. Ifa and b are linearly dependent, determine whether 
Q  Ge4b 
(i) {a+ b,a- b) 
are linearly dependent sets. 
2. If(a, b}forms a basis, show that 
GQ {34,48} 
Gi) {a+ ba— B) 


form bases. 


Solutions 


1. Ifaand bare linearly dependent, then we know that there are 
numbers « and fi, not both zero, such that 


«a+ fb- 0. 
(i) It follows that 


a B S 
309 * 40D =0 


and hence 3g and 4b are also linearly dependent. 


This result will be proved formally in 
sub-section 0.3.2. 


22 


(ii) 


(i) 


(ii) 


a+b and a — b are linearly dependent if we can find 
numbers / and p, not both zero, such that 


A(a + b) t u(a — b)=0, 
i.e. such that 
(2+ gat GO -ub- 0. 


Since a and b are linearly dependent, there are numbers œ 
and fi, not both zero, such that 


xactfb-O0. 
Suppose that we choose Aci —« 
and A—-u-f 
so that 
xf  a-f 
A= 2 and p= z` 


Then, if 4 and u are not both zero, we have shown that 
a + banda — bare linearly dependent. But this follows at 
once, since 4 = u = 0 implies a = 8 = 0, which we know 
to be false. 


Suppose that we can find scalars « and f) such that 
a(3a) + B(4b) = 0, 
ie. 3a( a) + 4f(b) = 0. 


Since {a,b} is linearly independent, it follows that 
3a = 4f = 0, which implies that 


a=f=0. 
So (3a, 4b} is linearly independent. 


There remains to be shown that any geometric vector (in 
the plane), v say, can be expressed as a linear combination 


2 = 239) + (4b) 


for some real 2, u. Since (a, b) is a basis we can certainly 
find real numbers g, 8 such that 


z=ag+ fb 
& s f 
7369 * 4105. 
ie. a =5 and p= E, 


Thus {3a,4b} satisfies both of the conditions for a basis. 


To show that {a + b, a — b} isa basis, we must show that 
it is linearly independent and spans the set of geometric 
vectors. 


Suppose firstly that o, fl € R are scalars such that 
ala + b) + Bla- b) - 0. 


We must show that « = fj = 0. Now we can rewrite the 
above equation as 


(a + B)a + (x — B)b = 0; 
since (a, b}is a basis it is linearly independent, and so 
a+fp=a—B=0. 


The only solution of these equations is a = fj = 0. 


LM 0.2.5 


LM 02.5/0.2.6 


Next we must show that given y we can find scalars 2, pe R 
such that 


2—A(a- b) + (a — b) 
-(T0a-t(-p)b. 
We can certainly find a, f such that 
y=aat Pb 


since (a, b) is a basis and so spans the set of geometric vectors. 
We shall therefore solve our problem if we can find 2, ue R 


such that 
A+p=a 
A-p=B. 
Clearly 


A=7a+P), u-i«- p) 
will do the job required. 


0.2.6 Summary of Section 0.2 


In this section we have defined the terms 


geometric vector (page 11) 
resultant (page 13) 
addition (of geometric vectors) (page 13) 
scalar multiplication (of geometric vectors) (page 16) 
scalar (page 16) 
linear combination (page 18) 
span (page 18) 
linearly dependent (page 20) 
linearly independent (page 20) 
basis (for geometric vectors in the plane) (page 21) 


We introduced the notation 


AB (page 11) 
a (page 13) 
Q . (page 15) 
=i (page 15) 
a-b (page 15) 
=> (page 15) 
Techniques 


1. Addition of geometric vectors. 
2. Scalar multiplication of geometric vectors. 


3. Determine whether a given set of geometric vectors spans or is linearly 
independent. 


LM 0.3.1 


0.3 VECTOR SPACES 
0.3.1 The Algebra of Lists 


How can we best make use of a basis? Let us begin by considering the basis 
{i,j} consisting of the ‘unit’ geometric vectors in the directions of the x-axis 
and y-axis respectively. If a and b are any two geometric vectors in the plane 
then there are scalars (real numbers) o, «2, f, f; such that 


g-wu icto J 
b-fiic Bai . 
Then ‘ 
a+ b= (e, i +o2j) + (Bi + Boj) 
= (gı i + Bi i) + («2j + Boj using properties A2 and A5 
= (a; + Bb (2 + B2) using property B4. 
The remarkable thing to notice is that addition of the geometric vectors a 
and b is accomplished by adding separately the scalars multiplying i and 
the scalars multiplying j . That is to say, having chosen( i, j }as the basis iri 
terms of which our geometric vectors shall be expressed, addition of 


geometric vectors is reduced to addition in R, twice. This feature is most 
easily expressed in terms of the addition of 2 x 1 column matrices. 


The matrix equation 


MEMBRA 


contains all the information to determine a + b given a and b in terms of 
(i, j} Each geometric vector is represented by a column matrix or list 
whose first entry is the coefficient of i and whose second entry is the 
coefficient of j. It is no accident that addition of lists corresponds to 
addition of geometric vectors; indeed it would not take you very long to 
show that (matrix) addition of lists has essentially the properties AJ — 45 of 
geometric vectors. 


In the same way scalar multiptication of a geometric vector gives us 
Àa = A i 0 J) 
= A(a, i) + Aer J) property B3 
= (ai), (o2) property B2, 


which corresponds to the matrix equation 


bo as] Ds]. 


Here we have used the matrix corresponding to a dilation or scaling, which 
you first met in the Foundation Course, to represent scalar multiplication 
of geometric vectors, which corresponds to the scaling of a translation. 


Since the scalings which concern us here are uniform (the same for each axis) 
we can condense the matrix equation above to 


del- D] 


The rules we now have for addition and scalar multiplication of lists look 
deceptively like the corresponding rules defined for R? in sub-section 0.1.1. 
We exploited this correspondence in the Foundation Course, where we 
cheerfully rewrote the coordinates of a point in the plane, that is, an element 
- of R^, as a matrix (or list). If we want to express this representation formally 
we should say that the two number-pairs (1,0) and (0, 1) have the requisite 


24 


In the context of matrices, “2 x 1" is 
read as “two by one”; it refers to a 
matrix with two rows and one column. 


Each real number in a matrix is called 
an element or entry of the matrix. 


Check the matrix multiplication! 


M101 Block IV Unit 3, Sections 3.4 
and 3.5. 


LM 0.3.1 
properties to act asa basis for R?; every member (A, jj) e R? can be expressed 
uniquely as 


(A, u) = 2(,0) + x(0, 1). 


The basis {(1,0),(0,1)} now determines the list of coefficients H to 
H 


represent the pair (A, u). The full power of the matrix representation will not 
become apparent until we find it necessary to use a different set of pairs asa 
basis for R?, in the same way as we found alternative bases for the set of 
geometric vectors in the plane. 


We can extend our discussion of geometric vectors by considering 
translations in three-dimensional space. In this case no two geometric 
vectors can be found which span the whole set; we need three to do the job. 
Given a frame of Cartesian coordinate axes for space, we can choose 
{i,j,k} to be ‘unit’ geometric vectors in the direction of the x, y and z axes. 
Then an arbitrarily chosen geometric vector a can be expressed as a linear * 
combination 
a=0 icta j+ak 


for some @,, &2, &3€ R and, likewise, for any b we can find scalars f;, B2, 
B3€R such that 


b=B,i+ Baj + Bsk. 


Addition and scalar multiplication of geometric vectors in three dimensions 
satisfy all the properties listed in Section 0.2 and so we can show that 


a+ b= (x; + Bi)i (uat Bo) j + (a3 + ß3)k 
and 
Aa = (An) i + Qa3)j + (203) k. 


The details are similar to the two-dimensional case, and we shall not give 
them here. 


This leads us to define an algebra of lists with three entries (3 x 1 matrices) 
by 


Oy Bi a, + By 
a| + | 2| = [02 B2 
a3 Bs a3 + Bs 
and 
Oy Ao; 
Ala, | = | Axa 
[^ Atta 


These lists give a neat way of specifying geometric vectors; but do they 
only give us an alternative notation, or do they suggest anything new? Let's 
forget for a moment the origins of the lists. Equations (1) and (2) define ways 
of manipulating lists of numbers. There is no reason why we should always 
have only two or three elements in the list. Equations (1) and (2) can be 
extended to lists with more than three elements; for example, we can write 


a bi ai +b; 
az b, 5 + bz 
a3 b az + b3 
VA Se : 

à, b, Qn + b, 


25 


Equation (1) 


Equation (2) 


LM 0.3.1 


and 
a, day 
az haz 
az Àa5 
À =| 
a, a, 


But this is rather futile if we have a physical or mathematical interpretation 
only when the lists contain three elements or fewer. However, we can use 
these lists to describe situations other than the algebra of geometric vectors, 
and we can interpret results and concepts in one situation (for example, 
basis and linear independence) to give results and concepts in another. That 
being so, we shall go on to discuss the abstract structure which typifies all 
the exemplary situations. 


What else can we represent by lists? 


Example 1 Polynomial Functions 

Consider the set of all polynomial functions of the form 
p:ixe— ax? bx? +cx+d — (xeR) 

where a, b, c and d are real numbers. 


We can represent p by the four coefficients a, b, c and d, which we can 
arrange as a list: 


o 7 8 


d 


Theaddition oftwo such polynomial functions corresponds to the addition 
of the corresponding two lists. Thus if 


piix ay? + b,x? t eux + dy (xe R) 
and 

Dpixe—agx5 + box? + cax +d, —(xeR) 
then we define addition of functions by 

Py + paix — pi(x) + p2(x) (xeR) 
and the scalar multiplication of functions by 

Ap:x —»5 Ap(x) (xe R). 


It does not take much effort to see that p, + p; corresponds to the list 


a az ay + a2 

by H b; = by +b: 

cy [2 €T 6; 

di [da | di +d, 

and the function 2p corresponds to the list 
a [Aa] 
b Ab 
pi = 

c Ac 
d | Ad | 


By considering polynomials of degree higher than three we would get 
examples of lists with more than four elements. It is worth noting in this 


26 


Remember that a, b, c and d can be any 
rea! numbers, and so a function such 
as f:x + 0x3 + Ox? + 1 is included 
in this set of functions. 


LM 0.3.1 


example that although the definitions of addition and scalar multiplication 
of functions are ‘obvious’, we are considering functions as the members of a 
set on which operations can be defined. 


Example 2 Solutions of Differential Equations 


Often in applied mathematics we are faced with the problem of finding a 
function, f say, which is related to a given function g through its derivatives. 
For example, we may need to determine the set of functions 


U:f"() — 3f'(x) + 2x) = g(x) for all xeR). 


We call an equation such as the one above differential equation, and each f 
in the stated set satisfies the equation or is a solution of the equation. 


Suppose, for example, that g were the zero function, defined by 
gixr30 for xe R. 
Then the function 
fiuxee for xeR 
is one function which satisfies the equation. 
In fact, 
Si(x)=e* and f(x) =e 
so that 
Si (x) — 3 (x) + Alx) = e — 3e + 2e* = 0. 
Another function which satisfies the equation is 
fux for xeR 


We shall show later in this course that any solution of this differential 
equation has the form 


of, + Bf 


where g and $ are real numbers, and f, and fz are the functions given above. 
If we take f, and f; as basic solutions, then any solution of the form af, + fif 


can be represented by the list E The particular solutions f; and f; can be 


represented by A and H respectively, and in general the list H 


represents the function 
x ge + pe” for xeR. 


You may like to verify that the lists 


E[l- 
(al[s | 


also represent solutions of the equation. 


and 


Exercise 
Show that the function 
x ge + pe forxeR 


satisfies the differential equation 


f(x) — 3x) + f(x) = 0. 


27 


Differential equations were 
duced in M101 Block V Unit 2. 


intro- 


LM 0.3.1/0.3.2 


Solution 
ie f(x)eae + Be™* 
then f'(x) = ae + 2fe?* 
and f"(x) = ae* + 4fie?* 
So /f"(x) — 3f'(x) + 2f(x) = ae* + 4fe?* — 3ae* — 6fe?* 
+ 2ae* + 29e? 
=0 for all xe R. 


0.3.2 Vector Spaces 


From what we have learned about geometric vectors, we are now able to 
construct an abstract mathematical structure called a'vector space. 


A feature common to the geometric vectors and the examples in the last 
sub-section is that, in each case, we had a set on which we could sensibly 
define addition and multiplication by a scalar. 


We shall take the structure which we have developed on the set of geometric 
vectors as our model, and discuss an arbitrary set with operations called 
addition and multiplication by a scalar defined on it. If the structure satisfies 
the following axioms, then we call it a vector space, and we call its elements 
vectors. 


The set of geometric vectors is a particular example of vector space, and it 
is the origin of the subject in geometry which motivates this use of the word 
space, 


One of the purposes of talking about structures such as vector spaces in the 
abstract is that we hope to be able to represent a number of apparently 
different structures in the same terms. Thus when we refer to a vector in a 
vector space, it may be a number pair, a geometric vector, a solution to a 
differential equation, a list, a polynomial function, or one of many other 
things. By proving theorems about vector spaces in general, we are able to 
obtain results for all these different situations at once. This ‘increase in 
productivity’ is a prime reason for generalization in mathematics. 


We choose a notation which is not too suggestive of any one particular 
example, and use boldface letters such as v, a, i to represent vectors. 


In order that a set V should be called a vector space, we require that the 
operations of addition of members of Vand scalar multiplication of members 
of V should be defined and have the following properties (modelled upon 
those of geometric vectors given in Section 0.2): 


Axioms of a Vector Space 
For any elements v, v;, v;, Y3 of V and any real numbers o, ff: 


Al vY, + vis a unique element of V (V is closed for addition) 
A2 v, -F (V2 + v3) = (v1 + Y2) + v3 (addition is associative) 
A3 There is an element in V, which we call vo such that 


Yd Yo -Y 


44 For each v there is an element —y such that v + (—v) = vo. 
AS vi t Yo — V5 +Y; (addition is commutative) 

Bl av is an element of V 

B2 (ef) = af By) 

B3 ao(v, + v2) = (avi) + (av) 

B4 (a+ f) =av + fv. 

B5 Ixv-v 


28 


LM 0.3.2 


These ten axioms are the axioms of a vector space, There are two important 
points to note. Strictly speaking, we should call V a vector space over the 
real numbers or a real vector space, because vector spaces exist involving 
sets of scalars other than the set of real numbers; we shall discuss only 
vector spaces over the real numbers. Secondly, we have taken as implicit all 
the relevant properties of the real numbers, and these should really be stated 
along with the other axioms. Any other set with these properties can be 
taken as the set of scalars in place of the set of real numbers to give a 
different vector space. 


The axioms of a vector space therefore consist of three sets of axioms: 


(i) those applying to the set of vectors only (A7 to A5 above); 

(i) those applying to the set of scalars only (not stated above: the missing 
axioms are the axioms of what is known in mathematics as a field); 

(iii) those which describe the interaction between the set of scalars and 
the set of vectors (B7 to B5 above). 


We define an operation of subtraction of vectors by 
Yi — Y2 =V; cb (— v3). 


From axiom A4, it follows that v — v = Vo. 


The Zero Vector 


In a vector space, an element vo which satisfies axiom 43 is called a zero 
vector. It follows from the axioms that in any vector space V there is only 
one zero vector. For suppose there are two vectors vo and vo which satisfy 
axiom 43. That is, 


V+Vo=V¥ 
V+V¥o=Y, 


where, in each equation, v is any element of V. Let us put v = vo in the first 
equation and v = Yo in the second equation. We obtain 


Yo + Vo = Yo 
Yo + Yo = Vo- 
By axiom 45, 
Yo + Yo = Vo + Yo 
ie. Yo = Yos 
so the zero vector is unique. 


Since the zero vector in a vector space behaves just like the zero geometric 
vector, we shall call this element the zero vector and denote it by 0, just as we 
had 0 for the zero geometric vector. (In terms of lists, 0 is the list in which 
every entry is zero.) 


Further properties of 0 can be deduced from the axioms. For example, it can 
be shown that 


«0 = 0, 


where « is any real number. 


Exercises 


1. (i) Which of the examples of sub-section 0.3.1 describe vector 
spaces? 
(ii) The set of all polynomial functions of degree n with the 
operations of addition of functions and multiplication of a 
function by a real number is not a vector space. Why not? 


Suggest a suitable modification to make it a vector space. 


LM 0.3.2 


HINT: A (real) polynomial function of degree n is a function of the 
form 
XH A,X" + a, ax" | He tax d ay 
for xe R in which a; are real numbers (i = 0, 1,...,) and 
a, #0. 


2. Ineach ofthe following cases state whether the given set of lists forms a 
vector space for the operations of addition of lists and multiplication of 
a list by a scalar. In each case give reasons for your answer. 


(i) The set of all lists M , Where x,, x» are positive rea; numbers. 


X2 


1 


(ii) The set of all lists [: } where x,,X2 are: real numbers and 


X2 
xX, +x, =0. 
i x 
(ii) The set of all lists l l where x, and x; are real numbers and 
X3 - 
Xq < X2. 
Xx 
(v) The set of all lists | x; |, where x,,x2 and x3 are real numbers 
X3, 


such that the function 
Jiti x,t? + xt +x; — forteR 
satisfies f(k) = 0, where k is a fixed real number. 


3. If Vis a vector space with zero vector 0, show that (0) is also a vector 
Space. 


Solutions 


1. (i) Both examples describe vector spaces. 
(Example 1 is discussed by implication below.) 

(i) The problem is caused when we add, say, the polynomial 
function x:—»5 —x" to the polynomial function 
xt» x" + x7 !, Both are of degree n, but their sum is the 
polynomial x ++ x"~', which is of degree n — 1, so + is 
not closed, i.e. axiom A1 is violated. A simple modification 
is to consider the set of polynomials of degree less than or 
equal to n. With the suggested operations, this set is indeed 
a vector space. 


2. (i) No. For example, multiplication by a negative scalar 
takes us out of the set, i.e. axiom B/ is violated. 


(ii) Yes. All the axioms are satisfied. (In fact, all the points in 
R? corresponding to the vectors lie on the line defined by 
the equation y + x = 0.) 
(iii) No. For example, if x, < x; and « < 0, then ax, > AXi; 
ie. bs ] does not belong to the given set, so axiom B/ is 
2 
violated. 


(iv) Yes. All the axioms of a vector space are satisfied. (Each 
function has a graph which passes through the fixed point 
(,0).) 


30 


LM 0.3.2 


3. We check that the axioms of a vector space are satisfied. 


Al 0+ 0 = 0, since 0 is the zero element of V, so {0} is closed 
for addition. 

A3 0e {0} 

A4 —0 — 0, since 0 + 0 = 0. 

B1 «0 = Oe (0) (by a result quoted in the text). 


The other axioms are automatically satisfied, since they are 
satisfied for all elements of V, and 0c V. 


Hence (0) is a (real) vector space. 
Where next? 


In the case of geometric vectors, we introduced the idea of a basis. The 
development of this idea depended on the concepts of linear combination of 
vectors and linear dependence. We can extend these ideas to the more 
general concept of a vector space. 


We also made passing reference to these ideas in our examples in sub- 
section 0.3.1. In the differential equation example, we shall see that every 
solution of 


f(x) — 3f'(x) + 2f(x) = 0 

can be represented in terms of two basic solutions, for example 
fixe for xeR 
faxe—e*  forxeR. 


How do we choose a basis for a vector space? How many vectors do we 
need? If we can settle the question of how many vectors we need—can we 
select that number of vectors at random? 


Before we extend our idea of a basis to an abstract vector space, we shall 
define linear dependence in this context. 
Linear Dependence and Independence 


The following definitions generalize the notion of linear dependence which 
we introduced for geometric vectors. 


If v4, v2, Va, ..., V, are vectors from a vector space, then an expression of the 
form 


[4 £1 + 21 £1 + 033 boo E au. 
where the o; are real numbers, is called a linear combination of vectors. 


The set of vectors (v;, v2,...,v,) is said to be linearly dependent if and only if 
there exist real numbers «,,02,...,@,, Which are not all zero, such that 


04V, + 02 + O3V3 + b ouv, = 05 


in other words, if 0 is a non-trivial linear combination of the vectors 
Yrs Y2y s Ye 


A set of vectors which is not linearly dependent is said to be linearly 
K T ] E 
independent. We can define this term in a more positive way as follows. 


A set of vectors (v, Y2, Va, ..., Ya} is linearly independent if and only if the 
equation 


OV, + OV, + 03V3 b eus =O 
has just one solution, namely 
Qi = 0, = 03 =-:-=a,=0. 


Remember that we use the terms dependent and independent in this way 
because we can express some members of a linearly dependent set in terms 


31 


LM 0.3.2 


of the others. For example, if«, is not zero, we can use the axioms of a vector 
space to write 


Vy + Aa + V3 ob xv, = O 
in the form 
OV) = (—O2)¥2 + (—a3)¥3 + +++ +(—O,)V,, 


and then divide by a, to give: 


0 ay ay 
ie. v, depends on (i.e. is a linear combination of) the other vectors. In 
general, if a set of vectors is linearly dependent, some of the vectors in the set 
(not necessarily every vector, because some of the a’s may be zero) can be 
expressed in terms of the others. In other words, some of the elements in the 
set are redundant. . 
Exercises 


4. Ineach of the following parts a set of vectors is given. In each case state 
whether or not the set is linearly independent. . 


In those cases where the set is linearly dependent, express one of the 
vectors in the set as a linear combination of the others. 


(i The vector space R? is defined as the set of ordered triples of real 
numbers, i.e., ((x,y,z):x, y,ze R} with componentwise addition 
and scalar multiplication, 


(x,y,z) + (u,v, w) = (x + u, y + uz + w) 
a(x, y, z) = (ax, ay, az). 
(a) {(1, — 1,0), (0, 1,0), (1, 0, 0)} 
(b) {(2,0,0), (0, 3, 0), (0,0, 5)} 
(ii) The set of functions {f,g}, where 
SiX x (xeR) 
g:ixe— x? (xe R), 


with the operations of addition of functions and multiplication 
of a function by a real number.. 


5. Ifthe set of vectors (v,, v2, vs,..., Yn} is linearly independent, show that 
if 
GV b ava ob Ry = favi + ava b + vs 
then 
&, = By, à = []5, 3. On = By. 


6. If(vi,va,..., Yn} is a linearly independent set of vectors, prove that any 
subset of this set is also linearly independent. 


7. If {v,,¥2,...,¥,} is a linearly dependent subset of a vector space V, 


prove that (v; v;,..., Yn, W} is also linearly dependent, where w is any 
element in V. 


32 


The zero vector in R? is the triple 
(0, 0, 0). 


The zero vector in this case is the 
function 


0:xi—0 (xeR). 


Notice that this result implies that a 
vector v cannot be expressed in two 
different ways as a linear combination 
ofa set of linearly independent vectors. 


Solutions 


4. 


LM 0.32/0.3.3 


(i) (a) This set of triples is linearly dependent: for instance, 
(1, 0,0) = (1, — 1,0) + (0, 1,0). 
(b) This set of triples is linearly independent. If 
(2, 0,0) + æ2(0, 3,0) + «3(0, 0, 5) = (0,0, 0) 
then (224,325, 53) = (0, 0,0), 


whence à, =a, =a3 = 0. 
(ii) The set of functions is linearly independent, because 
af + [jg = 0 implies that 


ax + fix? = 0 for all values of x, 
and this is possible only if « = f = 0. 


Using the axioms ofa vector space, we can show that the given 
equation is equivalent to 


(à — Bia + (@2 — Baa + +++ + (s — Bv, = 0. 


Since the set of vectors (v; ,v;,..., v, is linearly independent, 
the coefficients of the vectors in the above equation are all zero, 
so 


a — By =a, — B2 =--- =a, — Ba =0, 
which proves the required result. 


Suppose, in contradiction to what we want to prove, that the 
subset {V;,¥2,-..,¥,} is linearly dependent: then there are 
numbers 0, ,02,...,@, (not all zero) such that 


OV + OV +--+ ay, = 0. 
Therefore 
04V, + Aaa +++ + Vy + OG +--+ + Ov, 0 


But not all the «,,...,«, are zero, and hence the set of vectors 
{v;,¥o,---,¥a} is linearly dependent—which is a contra- 
diction. : 


If {v,,¥2,.-.,¥,} is linearly dependent, then there are numbers 
1,02,...,@,, not all zero, such that 


OV, + O2V2 b oH ev, = 0. 
Hence j 


04V, + Goo +--+ 0,¥, + OW = 0 


Not all the coefficients in this last equation are zero, and so we 
have proved the required result. 


Tryx=1 and x=-l. 


This popular method of proof is 
known as proof by contradiction. We 
suppose that the stated result is false, 
and show that this supposition leads 
toacontradiction in terms of the given 
hypothesis. We thereby deduce that 
the stated result is true. 


33 


LM 0.3.3 


0.3.3 Bases and Dimension 


In sub-section 0.2.4 we saw that it is possible to select two geometric vectors 
in a plane, and then to specify every geometric vector in the plane as a linear 
combination of those two. Similarly, in three dimensions we need to select 
three geometric vectors. We called such a set a basis, and we now wish to 
extend the same idea to an abstract vector space. 


The set of vectors (v, Y2, ..., Ym} iS said to span the vector space V if for each 
element w in V we can find scalars a,,05,...,0,, such that 

W = 04V, + O3V2 + G3V3 t or Ys. 
If the set of vectors (v;, vz,..., v,] is linearly independent and spans the 
vector space V, then we say that it forms a basis for V. 


Essentially, a basis contains the minimum number of elements which are 
required to span the space. In Exercise 5 of sub-section 0.3.2 we saw that any 
vector can be expressed in a unique way as a linear combination of the 
elements of a basis. 


For example, the set {i,j,k} spans the three-dimensional geometric 
vector space, because each geometric vector r can be expressed in the form 


r=xi+yj+zk. 


az 


Xy. 


Here i,j and k play the parts of v;,v; and v, and we know that it is 
possible to find the appropriate values x, y and z which play the parts of 
0,0» and gz. Any set of geometric vectors containing i,j and k and other 
geometric vector(s) would also span the space, but it would not form a basis, 
since such a set would be linearly dependent (the other geometric vector(s) 
would be redundant). 
Exercise 
Show that the set ((1,0,0), (1, 1, 1), (0, 0, 1)} is a basis for the space R? of all 
triples of real numbers. 
Solution 
Any triple (x1, x5,x3)e R? can be written as 
(x1, X3, X3) 
= (xı — x3)(1, 0,0) + x2(1, 1, 1) + (xs — x3)(0,0, 1), 


so the three given vectors span RÌ. 


34 


LM 0.3.3 


Also the set of triples is linearly independent, since 
24(1,0,0) + æ2(1, 1, 1) + a3(0, 0, 1) = (0,0, 0) 
implies (x, + &2,02,@2 + a3) = (0,0,0) 


ie. a +a,=0 
a, =0 
a, +a3 — 0, 

whence 


a =a, = &3 =l. 
It follows that the given set of triples is a basis. 
As a result of this last exercise we have two distinct bases, namely 
{(1, 0, 0), (0, 1, 0), (0, 0, 1)} and {(1, 0, 0), (1, 1, 1), (0,0, 1)}, for the same vector 
space R?, the space of all triples; and in this case both bases consist of three 
vectors. In fact, although we shall not prove it here, this always happens: for 
any two sets of basis vectors for the same vector space, there is always the 


same number of vectors in each basis. This enables us to make the following 
definition. 


If (vi, v2,..., Yn} is a basis for a vector space V, then we say that the 
vector space is of dimension n. 


If it is impossible to find a finite number of elements of a vector space V 
which form a basis for V, and V # {0}, then we say that V has infinite 
dimension. 


It is in fact also true that any set of n linearly independent vectors in a vector 
space V of dimension n is a basis for V, but the proof of this result must be 
deferred until later. 


If we assume these results, then we can see that, since {(1, 0), (0, 1)} basis of 
the vector space R? of ordered pairs of real numbers, this vector space 
therefore has dimension 2. The set {(1, 0, 0), (0, 1, 0), (0, 0, 1)} is a basis for R?, 
the space of ordered triples, and this vector space is therefore of dimension 
3. Let us look now at some non-geometric examples. 


Example 1 
The set of all polynomial functions of degree 2 or less, i.e. of the form: 
[ix ax? + bxc (xe R) 


where a, b, cc R, forms a vector space with the operations of addition of 
functions and multiplication of a function by a real number. 


Wecan find many sets of three vectors in this vector space which are linearly 
independent. One such set, which is particularly simple, consists of the 
vectors : 


Syixrol (xe R) 
fyixrox (xe R) 
fix > x? (xe R). 
Any quadratic function can be expressed in terms of these three, and hence 


they form a basis for the vector space. The dimension of the space is 
therefore 3. The function 


fix — 3x? - 2x +4 (xe R), 
can be written as a linear combination of the basis vectors: 


f — 3f, — 2f, + 4f, 


(x — 3x? — 2x + 4) = 3(x + x?) - 2(x — x) + 4(x i 1). 


35 


The proof will be given in Unit /. 


The f's are shown boldface in order to 


emphasize the fact that 
considering the functions 
elements of a vector space. 


we are 
to be 


LM 0.3.3 


Example 2 

In sub-section 0.3.1, we stated that any solution of the equation 
S's) — 3f6) + 2f) = 0 

can be expressed in terms of the two solutions 
fi::x—e (xe R), 
fixe — (xeR) 


In other words, these two functions span the space of solutions. Since the 
two solutions f, and f; are linearly independent, the set of all solutions of If you are worried by the statement 


the equation forms a vector space of dimension 2. that f, and f, are linearly indpendent, 
i you might like to try to prove it. 


0.3.4 Summary of Section 0.3 


In this section we defined the terms 


list (page 24) 
addition and scalar multiplication of functions (page 26) 
solution of a differential equation (page 27) 
zero function (page 27) 
vector space (page 28) 
zero vector (page 29) 
linear combination (page 31) 
linear dependence and independence (page 31) 
basis (page 34) 
dimension (page 35) 


We introduced the notation 


Oy En 
Al - for the list | - (pages 24, 25, 26) 
ay Aa, 
vai (page 28) 
0 ; (page 29) 
R?, the space of ordered triples of real numbers. (page 32) 


We have seen how sets of ordered pairs (or triples) and suitable sets of 
functions may be endowed with the operations of addition and scalar 
multiplication to yield an algebraic structure with properties analogous to 
those of addition and scalar multiplication of geometric vectors. 


A structure which possesses the selected properties is called a (real) vector 
space. The concept of a vector space is of fundamental importance; indeed 
the whole course is built round it. In Unit / we shall put the subject on firm 
foundations by proving the unproved assertions of this section. 


Meanwhile, we shall continue this unit by investigating transformations 
(functions) between vector spaces which preserve the fundamental linear 
structure, 


36 


LM 0.4.0/0.4.1 


0.4 MAPPINGS OF VECTOR SPACES 


0.4.0 Introduction 


We have seen that vector spaces are interesting mathematical structures 
which model many different situations. But of itself this does not give us a 
way of solving problems. 


The concept of a function is often a way of introducing greater 
sophistication into a structure. We shall find that vector spaces become 
richer and more interesting when we introduce functions from one vector 
space to another, Such functions are often referred to as mappings and it is 
then usual to say that ‘a maps to b’ when a .—5 b. 


0.4.1 Mapping One Vector Space to Another 


Example 1 

Consider the mapping of R? to R? defined by 
(xy) — (=y, x). 

Let us have a look at what happens to the vectors (1,0) and (0, 1). We have 
(1,0) — (0,1) and (0,1)—5(—1,0). 


y y 
(0.1) | (04) | 
n m 
(10) x ? qd x 


The mapping has the effect of rotating these vectors through an angle z/2 
anti-clockwise about the origin, and this is indeed the effect on the entire 
plane. 


In this example, any circle centred at the origin maps onto itself. Every point 
of the set {(x, y): x? + y? = 1} moves but the set itself remains unchanged. For example, ($,$) — (—$.3). 


Now we shall look at a particularly significant example. 


Example 2 
Consider the mapping of R? to R? defined by 
(x,y) C7»). 


To see the effect of this mapping, it is worth looking at what happens to the 
vectors in a basis. If we choose the simple basis ((1,0), (0, 1)), we have 


(1,0) — (0, 0) 
(0, 1) —.(— L, D). 


37 


LM 04.1 


It is helpful to express everything in terms of our chosen basis, so let us write 
i=(1,0) j= (0,1). 

In terms of the vectors i and j, we have that j maps to —i + j, but i maps to 

the zero vector. 


We shall look at this mapping in a little more detail. What happens to the x- 
axis? On this axis y is zero and so every point on the x-axis maps to (0, 0): 
the entire x-axis shrinks into the origin. What about the line y = 1? Every 
point on this line maps to the point (— 1, 1). 


In fact, the entire plane maps on to the line whose equation is 
x+y=0. 


In terms of the basis vectors, every element in the image set is a scalar 
multiple of the vector —i + j. 


The image set has dimension 1, and so the effect of the mapping is to “lose” a 
dimension from our vector space. This is equivalent to saying that this 
mapping is not one-one. If we start with a point, P say, in the plane and map 
it to a point Q, on the line x + y = 0, then we cannot map back to the 
original point P. This is because the point Q on the line x + y = 0 (of the 
codomain) corresponds to the whole of the line parallel to the x-axis 
through P. 


We have chosen these two examples, in which R? was identified with the 
Cartesian plane, to give you a visualization of the sort of mappings we are 
going to consider. One of the pleasant features of linear algebra is that by 
considering a geometric situation we can often throw lights on non- 
geometric situations (and vice versa). Thus, a non-geometric analogue of 
the last case, where we “lost” a dimension, is provided by the following 
example. 


Example 3 


Let P, be the vector space of all polynomial functions of degree 3 or less. The 
operation of differentiation can be thought of as a “mapping” with domain 
P4. Each polynomial function in P, is mapped to its derived function: 
D:pe—p (pe Py) 
maps the space P, onto the space P, (which has dimension 3). In this case we 
could take as a basis for P, the set of functions: 
fix 1 (xeR) 
fixex (xeR) 
fix x? (xeR) 
fixe (xeR) 


Basis for P, 


This set of functions maps to the set 
fo:xt-+0 (xeR) 
fi:x l (xe R) 
fy x4 2x (xeR) Basis for P, 


fáixe—3x (xeR) 


38 


This space has dimension 4, and a 
basis is given below. 


LM 0.4.1/0.4.2 


Like the previous example, this mapping is not one-one. 


Note that fo’ is the zero vector in P5. It cannot belong to any basis for P3, 
since a basis must be a linearly independent set of three vectors. For 
consider the set {fo',g,h}, where g, he P3. Since 


afo + Og + Oh = fs, 
where g is any non-zero real number, we see that any set of (three) elements 
containing fo is linearly dependent. 
Exercise 


We seem to be putting a lot of faith in choosing a convenient basis. Is the 
choice of basis important? We shall resolve this di-ficulty later, but one 
point can be considered here. 


In Example 3, instead of fọ and f;, we could choose go and g,, where 
goix ml +x (xeR) 
gixe—1l1-x (xe R). 
Then (go, Z1, J2, f3} is another basis of P, and none of the basis vectors maps 
to the zero vector under D. Is {go,g1’, f] a basis for P? 
Solution 
go:xi—1 (xe R) 
g ixe—-1 (xe R). 


Although none of the basis vectors is mapped to the zero vector, 
{80.81.43} is linearly dependent, since 


lgo + lgi + 0f? = fo. 


£o and g; cannot both belong to the same basis because one is 
scalar multiple of the other. 


0.4.2 Linear Transformations 


There are many interesting and useful results concerning mappings of 
vector spaces, but the most fruitful field of study consists of those mappings 
which are homomorphisms or isomorphisms. When we are considering a 
mathematical structure, it is often illuminating to study the functions which 
preserve that structure. A function which preserves structure is often called 
a morphism. 


Let us first take a brief look at the additive structure ofa vector space V, that 
is, the axioms 4/—45. We observe that this structure is that of a 
(commutative) group. It is not the purpose of this course to study groups; 
but we shall be well served by noting the underlying principle of studying a 
structure using a morphism. In this case, let:us focus attention on a fixed 
scalar 4€ R, and consider the function 


Y —— Àv for ve V. 


Axiom B4 guarantees that the image of v, + v2 under this function is given 
by 


vi + Y3 AY; + Ay, 


since the right-hand side is equal to A(v, + v2). In other words, this function 
is a homomorphism of the group structure of V, since the operation of 
addition is preserved. 


39 


For example, all the functions of the 


form: 


fix +a (xeR), 


where ae R, map to the function f;'. 


M101 Block VI Unit 2. 


Homomorphism: M101 
Unit 4. 


Block VI 


The full structure of the vector space V includes the operation of scalar 
multiplication; we shall, therefore, be concerned with functions on V which 
preserve both operations. Suppose a function T maps a vector space V, with 
an addition operation +y, to a vector space U, with an addition operation 
+u. The additive structure will be preserved if, for any vectors v; and v; 
in V, 

T(v1 tv v3) = T(vi) +y T(v5). 


Since we have been abusing the symbol + through this unit (we have 
defined all sorts of methods of addition and called them all +), we shall 
continue to do so, and we drop the suffices U and V from the addition 
symbols. 


We then have 
T(vi + ¥2) = T(vi) + T(v2) (1) 


as the condition that T should be a morphism for the addition operations. 
For the other operation we require that 


T(av) = aT(v), Q) 
for any real number g and any vector ve V. 


Equations (1) and (2) are the conditions that T should be a morphism from ; 
the vector space V to the vector space U. 


The two equations can be combined to give the following equation: 
T(a,v1 + &2Y2) = a; T(v1) a T(v3), (3) 
for any real numbers a,,«2, and any vectors v, and v; € V. 


A mapping of a vector space to a vector space is often called a 
transformation, and when the mapping is a morphism it is called a linear 
transformation. This is another example of calling a particular type of 
mapping by a special name. 


The significance of a linear transformation is that, given a basis for the 
domain, it is sufficient to know the images of the basis vectors—the images 
of all other vectors must follow. For example, suppose V is a vector space 
of dimension 3, with a basis (a, b,c}; then every vector v e V can be expressed 
as 


v — oa + fb + yc 

for suitable scalars o, f, ye R. 

Now if T is a linear transformation, then 
T(v) = aT(a) + BT(b) + yT(c), 


so if we know the three image vectors T(a), T(b) and T(c), then we can 
deduce the image T(v) for all ve V. 


40 


LM 0.4.2 


Exercises 
1. Which of the following mappings are linear transformations? 
(i) The mapping of R? to R? such that 
T: (xi, x3) 9 (x2, x1) 
(ii) The mapping of R? to R? such that 
T:(x1,x3) — (x2, x2) 


(iii) The mapping of the set of all polynomial functions of degree n or 
less to itself such that 


T: pt— the derived function of p. 


(iv) The mapping of the set of all real functions which are twice- 
differentiable at all points in R to the set of all functions with 
domain R, such that 


Tif —2f" + f 3f. 


2. LetLbealinear transformation from a vector space V to a vector space 
U. Complete the gaps in the proof of the following theorem. 


THEOREM 


If the zero element of V is v, (i.e. vo is the element for which v + vo = v 
for any ve V), and if ug is the zero element of U, then L(vo) = ug. 


PROOF 


Since v + vo = v 


L(v +¥9) = dC] (a) 


But L is a linear transformation, so 


ieee 4 1] * A) (b) 


From (a) and (b), L(v) + L(vo) = L(v), so, subtracting L(v) from both 
sides, we see that 


L(vo) is the vector of U. (c) 


Confirm this result for each of the mappings which are linear 
transformations in Exercise 1. 


Exercise 2 shows that under a linear transformation the zero element in the 
domain vector space is mapped to the zero element in the codomain vector 
Space. 
Solutions 
L à) —— Txa) 01.2) = Tea Yu X2 + y2) 

= (X2 + y2, Xi + yı) 

= Q5) (2,31) 

= T(xy, x3) + T(y1, y2). 

and 
T(a(x1,x2)) = T(ax, ax) 

= (ax5,axi) 

= a(x2,%1) 

= aT(x1,X2) 


so Tis a morphism. 
4l 


Take the operations in the various 
vector spaces to be the usual ones. 


LM 0.4.2 


(ii) aT(x, x3) = a(x}, x3) = (ax?, 0x3), 
and 


T(a(x1,x3)) = T(ox,,ax2) = (a?x2, «?x3), 


Equation (2) is not satisfied, so T is not a morphism. These results follow directly from the 
(ii) Tis a morphism. properties of differentiation. 
(iv) Tis a morphism. 
2. (a v 
(b)  v,vo 
(c) zero 


For the morphisms of Exercise 1, we have: 
(i) (0,0) + (0, 0) 
(ii) (x ++ 0)— (x — 0) 
(iv) (x — 0)— (x — 0) 


You may have noticed that, for mappings which are not 
one-one, the zero vector is not the only vector which maps to 
the zero vector. For example, in (iii) we also have (xt— k) 
mapping to (xi— 0), where k is any real number. 


The following theorem is fundamental to the study of linear 
transformations. It provides the general framework for the three examples 
discussed in the preceding sub-section. 


Theorem 


If L is a linear transformation from a vector space V to a vector space U, 

then L(V) is a subset of U which is itself a vector space. Here L(V) denotes the sct of images 
L(v) for all ve V; it may be U or a 
proper subset of U. 


L L(v) 


Method of Proof 


We have to prove that the set L(V), with the operations of the vector space 
U, satisfies the vector space axioms listed in Section 0.3. To save you the 
trouble of referring back, we list the axioms for a vector space V: v, VMZRA 
are any elements of V and « and f are any real numbers: 


Al v ^ v5 € V and is unique. 
A2 vy t (Vi + v3) e (v +v) + vs 
A3 There is an element v, in V, such that 


Y Yo — V. 


44 Given ve V, there is an element —ve V such that v + (—v) = vo. 
AS ¥y4+¥2=¥2 +v. 

Bl aveV 

B2  (aB)v = a(fiv) 

B3 (a+ Py — av + fiv 

B4 a(v, + v2) = (avi) + (ava) 

BS 1xve=y. 


Axioms A2, A5, B2, B3, B4 and B5 are statements about all elements of a 
vector space, and since U is a vector space we do not have to check these 
axioms for L(V). On the other hand, axiom 43 is a statement that a 
particular kind of element (the zERO element) belongs to a vector space. 
Clearly, the zero element of U will not belong to every subset of U, so we 


42 


LM 0.4.2 


have to prove that it belongs to the particular subset L(V). Axioms A/ and 
B1 concern CLOSURE. If L(V) is to be a vector space, then any combination of 
elements in L(V) must give resulting elements still in L(V). This again is not 
necessarily true for any subset of U, so we must check it for L(V). Once we 
have proved B/, we shall not need to prove A4, because we always have 
(-Dv2 =v. 


Proof 


We have to prove that axioms A/, A3 and B/ hold for L(V). We have three 
pieces of information: 


(i) V is a vector space; 
(i) U is a vector space; 
(ii) L is a linear transformation. 


We have used (ii) to dispose of axioms A2, 45, B2, B3 and B5 but we have 
not yet used (i) and (iii). E 


We have proved that axiom 43 holds for L(V): in Exercise 2 we proved that 
under a linear transformation the zero vector vo in the domain V maps to 
the zero vector ug in the codomain U. So the zero vector of U belongs to 
L(V). 


Let us have a look at axiom A/; we must show that L(V) is closed unde 
addition. If u, and uz are any elements of L(V), then there are elements v, 
and v; in V such that 


u; = L(vi) 
uz = L(vj). 
Then 
uy + uz = L(vi) + L(v;) 
= L(y, + v2) (because L is a linear transformation) 
= L(v3) 


where v; is an element of V (by axiom A/ for the vector space V): L(v3) isan 
element of L(V), so u, + u; belongs to L(V), and L(V) is closed. 


The other closure axiom BJ is easily checked. 


If 
u = L(y), 
then 
au = aL(v) 
= L(av) (because L is a linear transformation) 


Therefore au € L(V). 


This completes the proof. Note that we have not proved that if 
m A T:Vi—»^U is not a morphism, then 
Ifa subset ofa vector space U is itself a vector space, then we call it a vector T(V) is not a vector space. For a 
subspace of U. So we have shown in the theorem that the image of a linear general mapping T, we do not know 
transformation is a vector subspace of the codomain. anything about T(V) if T is not a 
morphism. 
Exercise 


3. (i) The mapping from R? to R? defined by 
L:(xi,xi) — (7X2, X2) 


is a linear transformation. Prove directly, by verifying the axioms, 
that L(R?) is a vector space. 


43 


(ii) The mapping from R? to R? defined by 
T: (X1,X2) — (x17, x2) 


is not a linear transformation. Show that T(R?) is not a vector 
space by finding an axiom which is not satisfied. 


Solution 


3. (i) As in the proof of the theorem, we need to check axioms 
Al, A3 and BI only. 


Al The elements of L(R?) are of the form 
(—a,a), aeR 
and 
(—a,a) + (—b,b) - ( — (a + ba + b), 


and so axiom A/ is satisfied. 
A3 Consider the element (0,0) 


1((0,0)) = (0,0), 


so that (0,0) e L(R?) and axiom 43 is satisfied. 
BI Any element of L(R?) is of the form (— a, a) for 
some aeR; 


a(—a,a) = (—aa,aa), 


and so a(—a,a)e L(R?): axiom Bl is satisfied. 
(ii) The only axioms which may not be satisfied are Al, A3 
and BJ. Of these only B/ is not satisfied. 


a(x}, x2) = (ax},ax2) 


and if æ is negative, «x? is also negative, and so cannot be 
written as the square of a real number. We see therefore 
that for the given function T: R? + R?, the conclusion of 
the theorem is violated ; we may therefore deduce that Tis 
not a linear transformation. 


So far in the text, we have considered mappings of one vector space to 
another and we have concentrated our attention on linear transformations, 
i.e., those mappings which are morphisms. A linear transformation has the 
property that the image set itself is a vector space. 


An interesting feature of some of the morphisms we have met is that they 
map a vector space on to an image set which has a lower dimension. For 
example, we have had mappings of planes to lines, polynomuals of degree n 
or less to polynomials of degree n — 1 or less, and so on. Two questions 
arise. What has happened to:the "lost" dimensions? Can we predict in 
advance when we are going to "lose" a dimension? We shall look at these 
questions in the next sub-section. 


0.4.3 The Kernel 


Let us have another look at the linear transformation 
L:(x,y)— (=y,y) for (x,y)e R? 


which maps the plane R? to a line. In sub-section 0.4.1, we looked at a 
particular basis, and saw that one of the basis vectors mapped to the zero 


The subset of R? which we en- 
countered as T(R?) in the exercise is in 
fact the half-plane 

{(x1,x2):x; = 0). 


It is a convenient example of a subset 
which is not a vector subspace. 


LM 0.4.3 


element (0,0) in the codomain. We then investigated the mapping by 
looking to see what happened to particular subsets of the plane, We saw 
that any line parallel to the x-axis mapped to a single point. 


This raises two questions. Firstly, is it significant that we lose one basis 
vector and we lose one dimension? Secondly, it is all very well in this simple 
case to pick out a few significant subsets that tell us such a lot. We picked 
them out because we knew their properties. Consider now the images of the 
lines parallel to the y-axis in the domain. Any such line maps to the entire 
image set, for suppose we take the line for which x = a, then 


L:(a,y)+—+(~y,y) (eR) 


By considering certain subsets of the domain, we find that we can obtain 
information about L. Is there any particular subset which we can most 
profitably consider? That is, can we describe a subset in the domain which 
will give us information about L in a form which we can interpret easily? If 
so, can we extract any general feature which will help us with other 
examples? 


The clue is in our observation about the loss of a basis vector. The vector 
(1,0) maps to (0, 0), but of course it is not only this vector which "shrinks" to 
zero—so does every multiple of (1,0). So a whole set maps to (0,0). Why 
consider this particular set? Try the next exercise. 


Exercise 


1. The vectors (0, 1) and (2,2) form a basis for R?. Calculate L(0, 1) and 
L(2,2), where L is the mapping we have been discussing: 


L:iy)e—(-») (Oy) ER’). 
What happens to the linear independence of (0, 1) and (2, 2)? 


Solution 
l. L(0,1) = (—1,1) # (0,0) 
and : 
L(2,2) = (—2,2) # (0,0) 
but 
—2L(0, 1) + L(2,2) = (0,0). 
Although neither vector maps to zero, the pair of linearly 
independent vectors maps to a pair of dependent vectors. So 


although the original vectors forrn a basis for R?, their images 
do not. 


Exercise 1 shows us that, in our original basis, the choice of a vector which 
mapped to (0, 0) was purely fortuitous. It may so happen, as in this exercise, 
that none of the basis vectors maps to (0,0), even though we “lose” a 
dimension. But the whole set ((x, 0):œ~ € R} maps to (0, 0) whether or not one 
of its elements is in the basis. 


It seems then that the set which maps to (0, 0) tells us something about the 
“lost” dimension. In this case the set which maps to (0,0) has dimension 1 
(every element of the set can be obtained as a scalar multiple of the vector 


45 


Because the mapping is a linea 
transformation, 


L(a(1,0)) = «L(1,0) = (0,0). 


LM 0.4.3 


i = (1,0), and we lose just one dimension. Let us have a look at two more 
examples, one where we again lose one dimension and one where we lose 
more than one dimension. We shall again take examples which can be 
modelled geometrically because geometrical situations are easy to visualize. 


Example 1 
The mapping 
L: (x,y, z) — 05,0) 


is a linear transformation from R? to R?. The image of any point P is the 
point at the foot of the perpendicular from P to the plane with equation 
z = 0. Thus the domain maps to a plane. 


‘ 


z 


o Po y.z) 


x 


L maps a 3-dimensional space to a 2-dimensional space: we lose one 
dimension. The set which maps to the zero element (0, 0, 0) in the codomain 
is the set (0,0, z): ze R}, that is, the z-axis. This set is itself a vector space 
and its dimension is one, the same number as the number of "lost" 
dimensions. 
Example 2 
The mapping 

L:(x, yz) 9 (x, 2x, 0) 


is a linear transformation from R? to R?. The image of the point (x, y, z) 
depends only on its x-coordinate. Thus (1, 2, 3), (1, 6, 7), (1, 6, 99) all map to 
the point (1, 2, 0). Every point in the plane with equation x = 1 maps to this 
point. 


(1.2.0) 
x x 


Similarly, every point on the plane with equation x — 2 maps to the point 
(2, 4,0) and so on. Every plane perpendicular to the x-axis maps to a point 
on the line defined by the cquations: 


2x—y=0 
z=0, 


and the entire three-dimensional space maps to this complete line. 


46 


In three-dimensional Cartesian space, 
two linear equations are required to 
determine a line. 


LM 0.4.3 


Thus 
L(R?) = {(x, y,2):2x - y = 0,2 = 0}. 


The three-dimensional domain maps to a space of dimension 1. We seem to 
have lost two dimensions. 


Which set maps to the zero element? In this case the zero element is (0, 0, 0), 
and the set which maps to (0, 0, 0) is the set {(x, y, z): x = 0}, i.e. the yz-plane: 
this is itself a vector space and has dimension two. Notice that in this 
simple case we can use the “basis argument” again—if we can find an 
appropriate basis. Taking the set of vectors {(1,0, 0), (0, 1, 0), (0,0, 1)} as a 
basis, we see that both (0, 1, 0) and (0, 0, 1) map to (0,0, 0), and so we “lose” 
two vectors from the basis. In fact we “lose” any vector which can be 
expressed as a linear combination of these two vectors (i.e. the points in the 
plane with equation x — 0), because 


L(a(0, 1,0) + (0,0, 1)) = aL(0, 1,0) + BL(0,0, 1) 
= (0,0,0), 
since L is a linear transformation. 


We have seen that the subset of the domain which maps to the zero vector in 
the codomain plays an important part, so we now give it a name. 


If Lisa linear transformation from a vector space Vto a vector space U, and 
if Oy is the zero element in U, then the set 


ivive V, L(y) = 0,j 


is called the kernel of L. (Another name which is in common use for this set 
is the null space.) We shall denote the kernel by the letter K. 


There is one important point to notice here. In sub-section 0.3.3 we defined 
a basis of a vector space V to be a linearly independent set of vectors in V 
which spans V. We defined the dimension of Vto be the number of elements 
in a basis. Now the kernel ofa linear transformation might only contain the 
zero element of V; ie. we may have K = {0}. 


We write 0 instead of 0, here, because you may like to refer to our 
discussion of the vector space {0}, where we mentioned that 


að = 0, 


where « is any real number. This means that {0} is a linearly dependent set, 
that is {0} does not possess a basis. We adopt the following definition. 


The dimension of the zero vector space, {0}, is zero. 


47 


LM 0.4.3 


The kernel has some quite remarkable properties. We have already hinted 
at two of them which are printed in red below. 


(1) The kernel itself is a vector spocu. 


We have shown that L(V) is itself a vector space, and in Examples 1 and 2 
we have seen that 


(2) (dimension of L(V)) = (dimension of F) — (dimension of kernel). 


Exercises 
2. Find the kernel of each of the following linear transformations. 
(i) T:(x,,x2, x3) 9 (x1, X2,0) 
where T maps R? to R?. 
(d) T:(x,,x2) > (xy + x2, x1 — 2x2) 
where T maps R? to R?. 


In each case find the dimension of the kernel and verify statement 2 in 
the text. 


3. LetLbealinear transformation from a vector space V to a vector space, 
U. Show that the kernel of L is a vector subspace of V. 


HINT: How many of the axioms of a vector space need proving for the 
kernel? (See sub-section 0.4.2.) 


Solutions 


2. (i) {(0,0,x3):x3¢R}. Any element in this set is a scalar 
multiple of (0,0, 1). 


Gi) ((x1,x2):x1 + x? = 0 and x, — 2x; = 0). 
The pair of simultaneous equations 
Xi x2, =0 
xX, — 2x, =0 
has the single solution x, = 0, x; = 0. Thus the kernel is 
the set {(0, 0)}. 
The dimensions of the kernels are as follows: 


(i) 1. The dimension of the domain is 3, the dimension of its 
image set is 2, and 3 — 2 = 1. 


(ii) 0. The dimension of both the domain and its image is 2. 
Note that, by defining the dimension of (0) to be zero, we 
have ensured that statement 2 is satisfied when K = {0}. 


3. We need only prove that the elements of the kernel satisfy 
axioms 41, 43 and BI. As usual, we denote the kernel by K. 


Al If k, and k; belong to K, we want to prove that 
k, + k; € K. We recognize elements of K by the fact that 
under L they map to Oy, the zero element in U. 


L(k,--k;)-— L(k,) + L(kz) (Lisa linear 


transformation) 
= 0) + 0, (ky, k; e K) 
=0y (axiom A3 for U) 


Therefore, k; + k,eK. 
A3 We have already shown that L(0) = 0;. 
Therefore, 0 € K. 


48 


LM 0.4.3/0.4.4 


Bl Let ke K, then 
L(ak) = «L(k) (L is a linear transformation) 
— ay (ke K) 
=0y (property of the zero vector) 


The verification of axioms A/ and B/ can be amalgamated by 
using the definition of a linear transformation in the form 
given in Equation (3) of sub-section 0.4.2. 


0.4.4 Properties of the Kernel 
It is remarkable how much we can tell about a linear transformation just by ` 
considering its kernel. 


Let L bea linear transformation from a vector space V to a vector space U, 
and let K be its kernel. If ke K and ye V, then 


L(v + k) = L(v) + L(k) (L is a morphism) 
= L(v) + 0y (definition of K) 
= L(y) (axiom A3 for U), 


where Oy is the zero element in U. So v and v + k, where k is any element of 
the kernel, have the same image. 


Suppose now that we want to find all the elements in V which map to a given 
element ue U, and that we know one such element v, i.e. 


L(v) =u. 


Then we know immediately that v + k for all ke K are such elements, and 
theremarkable thing is that they are in fact all the elements which map to u. 
We can prove this as follows. Suppose v, € V maps to u, i.e. L(v;) = u. Then 
consider v, — v. We have 


L(vy — v) = L(v + (-v)) (definition of subtraction) 
= L(vi) + L(—v) (L is linear) 
= L(v) - L(v) (L is linear) 
=u-u (hypothesis) 
= 0y (axiom 44 for U). 


But the kernel K contains all those elements which map to 0y, so 

y, -v=k, 
for some k, € K. By axiom A1 for V,k, is unique. Adding v to both sides, we 
get 

v-vck,, 
and so v, is of the form v + some element of the kernel. This result has 
important consequences: for instance, if the kernel contains n linearly 
independent elements k,,...,k, then we know that if ve V maps to any 
given element of L(V), then so also do all vectors of the form 

v + Ayky + åka + --- + AE, 


Furthermore, if the kernel contains just one element (which will necessarily 
be the zero element in V), then we know immediately that L is one-one, i.e. 
an isomorphism. The following example applies this discussion. 


49 


Example 1 

Apply the above ideas to finding the solution of the equations 
2x+3y-—z=1 
x+ y-z-2 


in terms of vector spaces. 


Solution of Example 1 


One way of expressing the problem is to say that we want to find the set of 
triples (x, y, z) which satisfy these equations. If L is the mapping from R? to 
R? defined by 


Liu yz) — (2x + 3y — 2,x + y — 2), 


then we want to find the set which maps to (1, 2). 


NM 


L . 


We have seen that we can describe the set which maps to any particular 
element when we know the kernel and one element of the set. In the context 
of this example, this means that, if we can find one solution to the equations, 
we shall be able to find all the solutions, simply by adding to that solution 
each element of the kernel. So we have to find one solution and we have to 
find the kernel. 

One Solution 


If we give x or y or za particular value, then the equations will be reduced to 
two equations in two unknowns, which we can solve easily. 


For example, if we put z — 0, then we obtain the two equations 
2x + 3y=1 
x+ y =2, 
which we can solve to give 
x=5, y= -3. 
So one solution of the original equations 
2x 4: 3y —z-l 
x+ y-z=2 


x=5, y2 —3, z=0. 


The Kernel 


The kernel, K, is the set of triples (x,y,z) which map to (0,0), i.e. which 
satisfy the equations 


2x+3y—z=0 
x+ y-z-0 


Just as before, we can solve these equations by giving x or y or z a particular 
value and then trying to solve the resulting two equations in two unknowns; 
but this time it is not much help, because we simply get the one solution, and 


50 


We call this a particular solution of the 
equations. 


We call these the associated homo- 
geneous equations. 


LM 044 
we want all solutions. But if, for example, we give za general value and put 
z = k, then these equations become 
2x+3y=k 
x+ y=k, 
which we can solve to give 
xz2k  y-2-k. 


So one clement of the kernel is (2k, — k, k), and by varying k we get all the 
elements. So K is the set 


{(2k, — k, k): ke R}. 


We want to find the set shaded red in the above diagram. 
We know one element—how do we find the others? 


Any solution of the original equations 


2x + 3y-z=1 
x+ y-z=2 
is obtained by adding an element of the kernel to (5, —3,0). 
So the complete solution set is We say that 
{(5 + 2k, —3 — k, ky ke R}, x=5+2k, yo -3-k z=k 


and the theory we have developed assures us that this set contains all the is the general solution. 
possible solutions to the original equations. 
(Check that these are solutions by substituting into the original equations.) 
Notice that we can solve related problems like 
2x + 3y-z-27 
x+ y—z=99, 

where the right-hand sides of the equations are changed, very quickly—all 
we need to do is to find one particular solution of these equations and then 
add on each of the elements of the (same) kernel. We shall discuss problems 
of this type in considerable detail in Unit 3, Hermite Normal Form. 
Exercises 
l. By putting x = 0, find a particular solution of the equations 

2x + 3y—z=2 

x+ y+2=1 
Find the solution set of the equations. 


2. We can map the vector space P,, of all polynomial functions of degree 
less than n, to itself by the differentiation operator: 


D:po—p — (peR). 
We have already seen that this mapping is lincar. What is the kernel? 


What significance does this have in integration? 


51 


Solutions 


1, 


LM 


Putting x = 0 in both equations, gives 
3y-z-22 
ytz-l 
which have the solution 
y= z-i 
Thus one solution is 
x20, y=}, z-i 
To find the kernel, we have to solve the equations 
2x+3y—z=0 
x+ y+z=0 
If we put x =k, we get 
3y —z = —2k 
y+z=-k, 
which have the solution 
y= 3k, z= 4k; 
so the kernel is the set 
{(k, —3k, 4k): ke R}. 
The complete solution is therefore given by 
{(k,3 — 3k, 4 — d): ke R}. 


The kernel is the set of all polynomial functions which map to 
the zero function 


xi—0 for xeR; 
that is, the set of all constant functions, 
{f:xı—>k (xe R), where ke R}. 


The problem of integration is that of solving equations of the 
form 


D(p) = & 
where g is given. 


Since the kernel contains an infinite number of elements, the 
integration problem has an infinite number of solutions. If p is 


* one solution, then the set of all solutions is 


{p + fifixrok (xeR), where ke R}. 


Problems such as the following often arise in mathematics. We are given a 
mapping, M say, from a set A to a set B, and are required to find all the 
elements x of A such that 


M:x—>b, 


where b is a given element of B. That is to say, we have to solve the equation 
M(x) = b. Sometimes there is no solution (if b does not belong to M(A)); 
sometimes there is a unique solution; sometimes there are many solutions. 
In a very wide class of problems, A and B are vector spaces and M is a linear 
transformation. We have just seen that in these cases we can (in principle) 
adopt a standard procedure. We first find the kernel, K, that is, the solution 


set of 


M(x) — 0. 


52 


The equations 2x + 3y — z — 2 and 
X + y +z = lareequations of planes. 
The set of points 


(0,4 — 2k,4 — M)keR) 


corresponds to the line formed by the 
intersection of these two planes. 


This is usually a much easier problem 
than the original one. 


LM 0.4.4 


We then find any one particular solution of 
M(x) =b, 


and then combine this with each element of K, to get the complete solution 
set. 


We shall apply these techniques later in the course. 


0.4.5 Summary of Section 0.4 


In this section we defined the terms 


morphism (page 39) 
linear transformation (page 40) 
subspace (page 43) 
kernel (page 47) 
associated homogeneous equations (page 50) 


We introduced the notation 


L(V) (page 42) 
K ; (page 47) 
B (page 51) 


53 


0.5 SUMMARY OF THE UNIT 


We have seen how the concept of a vector space may be regarded as having 
its origins in geometry. Using the concept of a geometric vector, we were 
able to construct an algebraic structure by introducing “addition” and 
“multiplication by a scalar” on the set of all geometric vectors in two (or 
three) dimensions. We then saw that we could construct a very similar 
algebraic structure on the set of ordered pairs (or triples) of real numbers. 
There are in fact many different mathematical systems which have the same 
structure, and we chose to extract the important properties, then to study 
the abstract structure which possesses these properties. Such a structure 
we called a vector space. The concept of a vector space is of fundamental 
importance; it is so important that the whole course is built around it. 


We then discussed what happens when vector spaces are mapped to vector 
spaces. This led us to the concept of a linear transformation. 


In the next two units of the course, you will meet again much of what we 
have done on vector spaces and linear transformations. The subject will be 
put on firm foundations and many of the unproved statements in this unit 
will be verified. 


54 


LM 0 


LINEAR MATHEMATICS 


Nee ee eee eee 
SOMWADNARWNH O io 00 -20 Un I L0 I2 +O 


L9 C9 UO C9 h2 LO t2 P2 MO t2 PO T9 INO 
Q2) TO — Q 35 00 1 O Un & UO TO — 


Linear Algebra 

Vector Spaces 

Linear Transformations 

Hermite Normal Form 

Differential Equations I 

Determinants and Eigenvalues 

NO TEXT 

Introduction to Numerical Mathematics: Recurrence Relations 
Numerical Solution of Simultaneous Algebraic Equations 
Differential Equations II: Homogeneous Equations 
Jordan Normal Form 

Differential Equations III: Nonhomogeneous Equations 
Linear Functionals and Duality 

Systems of Differential Equations 

Bilinear and Quadratic Forms 

Affine Geometry and Convex Cones 

Euclidean Spaces I: Inner Products 

NO TEXT 

Linear Programming 

Least-squares Approximation 

Euclidean Spaces II: Convergence and Bases 
Numerical Solution of Differential Equations 

Fourier Series 

The Wave Equation 

Orthogonal and Symmetric Transformations 
Boundary-value Problems 

NO TEXT 

Chebyshev Approximation 

Theory of Games 

Laplace Transforms 

Numerical Solution of Eigenvalue Problems 

Fourier Transforms 

The Heat Conduction Equation 

Existence and Uniqueness Theorem for Differential Equations 


0 335 01125 X 


